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Abstract —In this paper, faulty successive cancellation decoding 
of polar codes for the binary erasure channel is studied. To 
this end, a simple erasure-based fault model is introduced to 
represent errors in the decoder and it is shown that, under 
this model, polarization does not happen, meaning that fully 
reliable communication is not possible at any rate. Furthermore, 
a lower bound on the frame error rate of polar codes under 
faulty SC decoding is provided, which is then used, along with 
a well-known upper bound, in order to choose a blocklength 
that minimizes the erasure probability under faulty decoding. 
Finally, an unequal error protection scheme that can re-enable 
asymptotically erasure-free transmission at a small rate loss and 
by protecting only a constant fraction of the decoder is proposed. 
The same scheme is also shown to significantly Improve the finite- 
length performance of the faulty successive cancellation decoder 
with negligible circuit overhead. 

Index Terms —Polar codes, successive cancellation decoding, 
faulty decoding. 


I. Introduction 

U NCERTAINTIES in the manufacturing process of inte¬ 
grated circuits are expected to play a significant role 
in the design of very-large-scale integration systems in the 
nanoscale era m-a. Due to these uncertainties, it will 
become more and more difficult to guarantee the correct 
behavior of integrated circuits at the gate level, meaning that 
the hardware may hecomt faulty in the sense that data is not al¬ 
ways processed or stored correctly. Moreover, very aggressive 
dynamic voltage scaling, which is commonly used to reduce 
the energy consumption of integrated circuits, can increase 
the occurrence of undesired faulty behavior ||4|- Traditional 
methods to ensure accurate hardware behavior, such as using 
larger transistors or circuit-level error correcting codes, are 
costly in terms of both area and power. 

Eortunately, many applications are inherently fault tolerant 
in the sense that they do not fail catastrophically under faulty 
hardware. A good example of such an application are com¬ 
munication systems, and more specifically channel decoders, 
since the processed data is already probabilistic in nature 
due to transmission over a noisy channel. Eaulty iterative 
decoding of LDPC codes was first studied in Q, where the 
Gallager A and sum-product algorithms are considered. Later 
studies also targeted the Gallager B algorithm Q and 
the min-sum algorithm ||8l, ||9l- All of the aforementioned 
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studies provide valuable insight into the limitations of LDPC 
codes under various decoding algorithms and fault models. 
Unfortunately, in many cases, the conclusion is that fully 
reliable communication is not possible when faults are present 
inside the decoder itself. Surprisingly, in some special cases, 
it has been demonstrated that faulty decoders can in fact even 
improve the error performance of LDPC codes in the finite 
blocklength regime 1101,11111. LDPC codes are usually studied 
with the help of random ensembles, meaning that a family 
of codes is studied rather than individual codes. Moreover 
there exists an infinite number of code ensembles with a given 
coding rate. Thus, it becomes unclear which code ensemble 
and which individual code should be studied. 

Polar codes ca constitute a different class of channel 
codes which has recently attracted significant attention, albeit 
not yet in the context of faulty decoding. Contrary to LDPC 
codes, a polar code for a given channel and coding rate is 
uniquely defined, thus greatly simplifying the choice of code 
to examine. Polar codes are provably capacity achieving over 
various channels and they have an efficient and structured 
successive cancellation decoding algorithm whose complexity 
is 0{N \ogN), where N is the length of the code. Moreover, 
encoding can also be performed with complexity that is 
0{N \ogN). When used for transmission over the binary 
erasure channel (BEC), the SC decoding algorithm can be 
highly simplified. Moreover, there exist analytical upper and 
lower bounds on the frame erasure rate (LER), which have 
been shown to be tight m and enable us to have a very 
good approximation of the LER without resorting to lengthy 
Monte-Carlo simulations. 

Contribution: In this paper we study successive cancellation 
decoding of polar codes for transmission over the BEC under 
an erasure-based internal fault model. We show that, under 
the fault model assumed in this paper, fully reliable commu¬ 
nication is no longer possible. Lurthermore, by studying the 
polarization process, we show that synthetic channel ordering 
with respect to both the channel erasure probability and the 
internal decoder erasure probability holds. We also adapt the 
lower bound on the LER derived in Ha to the case of faulty 
decoding, and we use it in order to derive the LER-optimal 
blocklength for a polar code of a given rate, and for a given 
channel and decoder erasure probability. Linally, we introduce 
a simple unequal error protection method, which is shown 
to re-enable asymptotically fully reliable communication by 
protecting only a constant fraction of the decoder. In the 
finite blocklength regime, our proposed fault-tolerance method 
significantly improves the LER with very low overhead. 
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Fig. 1; Synthetic channel construction for a polar code of 
length N = 2^ = 8. Pairs of solid lines represent the 
+ transformation and pairs of dashed lines represent the — 
transformation. 


Outline: The remainder of this paper is organized as fol¬ 
lows. Section|n]provides some background on the construction 
and decoding of polar codes. In Section [Hll we introduce the 
fault model that is used throughout this paper and we prove 
that fully reliable communication using polar codes is not 
possible under faulty decoding over the EEC. We also show 
some other useful properties of the faulty decoder. Moreover, 
in Section |IV] we adapt the lower bound on the PER derived 
in ifTSl to the case of faulty decoding, and in Section |V] we 
describe our proposed unequal error protection scheme. In 
Section IVIII we show numerical results concerning the PER, 
the optimal choice of blocklength, as well as the effectiveness 
of our proposed unequal error protection method. Pinally, 
Section I VIII I concludes this paper. 

Notation: We use the notation X = 1 — X. We use 
boldface letters to denote vectors, matrices, and strings. The 
n-th character of a string s is denoted by s„. We use log(-) 
to denote the binary logarithm. We denote the binary erasure 
channel with erasure probability p as BEC(p) and the ternary 
erasure channel with erasure probability p as TEC(p). We use 
0 to denote an empty string. Pinally, we use | • | to denote both 
the length of a string and the cardinality of a set. We use [a;] 
to denote the nearest integer of x (i.e., the rounding function) 
and \x'\ to denote the ceiling function. 

II. Polar Codes 

A. Polarizing Channel Transformation 

Let W denote a binary input memoryless channel with input 
u G {0,1}, output y € y, and transition probabilities 
W{y\u). The polarizing transformation proposed by Arikan 
generates 2"’ synthetic channels in n steps as follows. At step 1 
of the polarizing transformation, N independent copies of the 
channel W, denoted by k = 0,..., N—1, are combined 

pair-wise in order to generate N/2 independent copies of 
a pair of new synthetic channels denoted by and 


f}, k = 0,., N/2 — 1. The “H-” channels can be shown to 
be better, in terms of mutual information and Bhattacharyya 
parameter, than the original channel, while the channels 
are worse than the original channel. The same transformation 
is applied to and IE| J , k = 0,..., N/2 — 1 in order to 
generate N/A independent copies of ^2 

and W 2 \ fc = 0,..., N/4 — 1. This procedure is repeated 
for a total of n steps, until 2"’ channels lE^^g, s G {+, —}") 
are generated. Note that, in general, the notation W/ / implies 
that |s| = s. An example of the transformation steps is 
depicted in Pigure [T] for n = 3. 


B. Erasure Probability of Synthetic Channels 


Let ^ Z 


) denote the Bhattacharyya parameter 


of the synthetic channel When IE is a BEC(p), its 

Bhattacharyya parameter is equal to the erasure probability, 
i.e., Z = Z{W) = p. Moreover, all synthetic channels 

generated at step s are also BECs and their Bhattacharyya 
parameters (equivalently, their erasure probabilities) can be 
calculated recursively based on the Bhattacharyya parameters 
of the channels at step (s — 1) as ifT^ 


V-(s) 

s-l,k+2^- 


7(s“) yN) 

^s,k ~ ^s-l,k 

7(^-1-) 7(3) 7(3) 

■^s,k s-l,fe s-l,fe+2"-® 


_ yi^) yi^) /i'\ 

^s-l,k'^s-l,k+2^-‘’ 

( 2 ) 


where s = l,...,n, k = 0,...,2"“® — 1. The channels 
k = 0,..., 2”“® — 1, are independent copies of the 
same type of channel, meaning that their erasure probabilities 
are identical. Thus, if we are only interested in the erasure 
probability of a specihc type s of channel we can simplify ([T]) 
and (|2]) by omitting the index k as 

^(s-) ^ A ' , (3) 

Z(^+)=T+(zil\) 4 (4) 

with Zq^'^ = p. The vector containing all zi^\ s G {+,—}*, 
variables is denoted by Zg. 

Moreover, as in m, m, we dehne the polarization 
random process Cn as 

= (5) 


with P [S = s] = i.e., Cg is equally likely to be equal 

to the erasure probability of any of the 2® distinct types of 
synthetic channels at step s of the polarizing transformation. 
The random process eg can be written equivalently as 


f T (eg_i) w.p. 1/2, 
\ r+(eg_i) w.p. 1/2 


with eg = .^(lE) = p. It was shown in IfT^ that Cg converges 
almost surely to a random variable Coo G {0,1}, with P{eoo = 
0) = F(IE) = 1 — p, where I{W) denotes the symmetric 
capacity of the BEC W. 

Pinally, let us dehne a binary erasure indicator variable 

(s') * 

for which £"/ / = 1 if and only if the output of the synthetic 

(s') (s') 

channel IE/ / is an erasure and / = 0 otherwise. It is 
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clear that E 


E' 


= The indicator variables can also 


be determined recursively as follows ifTSll 

z?!®) z?!®) 

n(s) 


^s,k — ^s-l,k -C's-l,/c+2'‘-'» ^s-l,k^s-l,k+2’> 


77’(® + ) Z 7 I®) Jpk 

^s,k ''^s ——l,fc+2"-'’■ 


(7) 

( 8 ) 


Similarly to the Bhattacharyya parameters, if we are only 
interested in the statistics of the indicator variable for a channel 
of a specific type s, we can simplify (|7]l and ® as 


^(s-) ^ ^ , (9) 

i?(^+) = Ei^},'Ei^},'\ (10) 



Fig. 2: Synthetic channel construction for a polar code of 
length TV = 2^ = 4. Solid lines represent the + transformation 
and dashed lines represent the — transformation. 


(s) ' (s) " 

where E^J-^ and denote two independent realizations 

/g\ __ fg't 

of El_-^ 11131 . The vector containing all Es ^ indicator vari¬ 
ables is denoted by Eg. 


W are combined pair-wise through through a full binary tree 
structure of depth n that is identical to the channel combining 
structure of Figure [T] Let /+ and /~ be defined as 


C. Construction of Polar Codes 

Let us define a mapping from s G {+,—}” to the integer¬ 
valued indices i G {0, ...,2" — 1} as follows. First, we 
construct b by replacing each — that appears in s with a 
0 and each -f that appears in s with a 1. Then, the index 
i can be obtained by considering b as a left-MSB binary 
representation of i. As this mapping is a bijection, we use 
s and i interchangeably. 

Let us fix a blocklength N — 2"^ and a code rate R = 
0 < K < N. Moreover, let A denote the set of the K 
channel indices i (equivalently, strings s) with the smallest 

(s) 

Zn . A polar code of rate R is constructed by transmitting 
the information vector over the K best synthetic channels, 
while freezing the input of the remaining synthetic channels, 
i.e., to a value that is known at the receiver. This can 
be achieved by transmitting the encoded codeword x = uG„ 
over 2" independent uses of the initial BEC W, where 


and B„ denotes the bit-reversal permutation matrix ifT^ . Due 
to the structure of G^, encoding can be performed with 
complexity 0{N log N). If R < I{W) = 1 — p, then as n 
is increased, all synthetic channels W^^q, s G yf, become 
arbitrarily good and the polar code is capacity achieving mi. 


Gn = B„F 


(gin 


F = 


D. Successive Cancellation Decoding of Polar Codes 

Without loss of generality, we assume the output alphabet 
of the BEC kF to be 3^ = {—1,0,4-1}, where 0 denotes 
an erasure, while —1 corresponds to the binary input 1 
and 4-1 corresponds to the binary input 0. The successive 
cancellation (SC) decoder proposed by Arikan decodes the 
synthetic channels , s G An, successively following a 
natural ordering with respect to i (this is equivalent to a top- 
down decoding order of the VFg}], s G {4-, —channels in 
Eigure [T]). The channels s ^ An do not need to be decoded 
as, by construction, their input is know a-priori to the receiver. 
In order to estimate the input of the synthetic channel Wn^\ 
the 2" channel outputs resulting from 2" independent uses of 


/ (toi,TO 2) = TOim2, (12) 

(-1)“toi- f TO 2 I 

2 ’ ^ 

where u denotes a partial sum, which is the modulo-2 sum of 
some of the previously decoded bits and we use [—0.5] = 0 
and [0.5] = 0 for tie-breaking. The partial sums required by 
each the /+ nodes at level s can be calculated recursively from 
the partial sums at level s4-1 lITSll . If Sg = —, then all updates 
at level s of the tree are performed using f~, while if Sg = 4-, 
then all updates at level s of the tree are performed using 
/+. When level n is reached, the output message will either 
be correct (i.e., —1 or 4-1), or an erasure. If the final output 
message is correct, we can derive the corresponding bit value 
for Ui and proceed with decoding. If the final output messa^ 
is an erasure, the decoder halts and declares a block erasurel[l 
By re-using intermediate synthetic channel outputs, it can be 
shown that the complexity of SC decoding is 0{N log W) ifT^ . 


/+(TOi,m2,u) = 


III. Eaulty sc Decoding of Polar Codes 

All current SC decoder hardware implementations (e.g., 
Ei-mi) require a full binary tree of memory elements (MEs) 
of depth n, which store the messages that result from the 
update rules at each level of the decoder tree. The total number 
of MEs required by a decoder is 

n 

iVME = = 2"+^ - 1 = 2Af-1 G 0(W). (14) 

The processing elements (PEs), which apply the update rules, 
can also have a full binary tree structure for a fully-parallel 
implementation 021, although semi-parallel implementations 
are also possible oa.A fully-parallel implementation requires 
N—1 PEs, while in a semi-parallel implementation the number 
of PEs is restricted to P < N — 1. 

*We note that in the latter case the decoder could make a random decision 
and attempt to continue decoding, although this will only improve the block 
error probability by at most a factor of two. 
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r(s) 


— £/„ , . <:£/ 


,fc,5 s —l,fc,5 ^ s—l,/c4-2^-^,5 s —l,fc,5 s —i,fc+2^—~ \ s —l,fc,5 ^ s —l,fc+2^-®,5 s —l,fc,5 s—l,/c+2'^-^,5 


r(s) 


r(s) 




s./c ’ 


^(s + ) _ g(s) p,(s) 


^(®) 


,k,S s — l,k,5 a-l,k+2'^-’’,S I s—1,^,5 s —l,fc+2"-'*,5 / s,fc 


. (s+) 


(15) 

(16) 


A. Fault Model 

We model faulty decoding as additional internal erasures 
within the decoder, which may be caused either by faulty 
PEs or by faulty MEs (or both) and we assume, without 
loss of generality, that they manifest themselves when an 
output message is written to an ME. Moreover, we assume 
that these erasures are transient in the sense that whenever 
an ME is written to, the internal erasures occur independently 
of any previous internal erasures. The partial sums, which are 
required by the /+ update rule, also need to be stored in a 
memory, which however is typically smaller than the memory 
required to store the messages. Moreover, due to the partial 
sum recursive update rules ifT^ . a single erasure in a partial 
sum will result in erasures in all following partial sums and we 
can intuitively see that the sensitivity of the SC decoder with 
respect to faults in the partial sum memory is high. Thus, in 
this work we assume that the partial sum memory is fault-free. 

Under the above assumptions, the internal erasures occur 
at the output of all synthetic channels of a polar code of 
blocklength n, i.e., s = 1,... ,n, s G {+,—}*, k = 

0,..., 2"“'* — 1. Moreover, the internal erasures occur inde¬ 
pendently of the message value and with probability S. Let 
us define a ternary-input erasure channel (TEC) with input 
alphabet X = {—1,0,-1-1} and output alphabet y = X and 
the following transition probabilities 

P[0|0] = 1, (17) 

P[0|-1] =P[0|-fl] =5, (18) 

P[-Pl|-Pl] =P[-1|-1] = 1-5, (19) 


where the probabilities of all remaining transitions are equal 
to zero. 

Using the above TEC, our error model can be represented 
as a cascade of a EEC with a TEC, as shown in Eigure |2] 

(s) 

where results from the non-faulty polarizing channel 


transformation applied to a pair of channels ^ and 

(where t is a prehx of s) and “TEC” represents 


(t) 


^s-l,fe+2" 

the internal erasures caused by the faulty SC decoder. We 
denote this cascaded compound channel by s order to 
make the dependence on 5 explicit. It is easy to check that 
for (5 = 0 we get a non-faulty decoder, while for (5 = 1 
all messages are always erasures leading to a fully faulty 
decoder. Thus, it is mainly interesting to study the decoder 
for (5 G ( 0 , 1 ). 

In order to have a more rigorous definition of the internal 
erasure fault model, let us dehne the binary erasure indicator 
variable where = 1 iff the TEC that comes 

after in Eigure |2] causes an internal erasure at channel 


W. 


s,k 

and A^^], = 0 otherwise. By definition, we have 


s,k ’ 

=1 


= S, thus E 


yi 


= S and var 




(5(1—^). Since the internal erasures are assumed to be transient, 
all are independent. Due to the cascaded BEC-TEC 

structure, we can rewrite (|7]) and ([ 8 ]l using A^fl as (fTsT i and 
(Ull. Again, if we are only interested in the statistics of the 
indicator variable for a channel of a specihc type s, we can 
simplify (fTSl) and (fThl l as 


p(S“) _ p(®) ' I p(®) " _ p(®) ^p(®) " 

+ A(- 

Ei7^ = A<f+\ (21) 


( 20 ) 


fs) ^ (s') 

where ^ and ^ denote two independent realiza- 
tions of El_-^ g and As and A} ^ denote a realization of 
Ag^^^ and respectively. The vector containing all E^^j 

indicator variables is denoted by Eg, 5 . 

We note that in a fully-parallel implementation, each ME 
has a corresponding PE, and our erasure-based fault model 
can take erasures in both the MEs and the PEs into account 
simultaneously. In a semi-parallel implementation, on the other 
hand, the MEs are signihcantly more than the PEs (i.e., 
typically P ^ 2N — 1 , as in ifTSll where N = 1024 and 
P = 64), so it is reasonable to assume that faults stem only 
from the MEs, as the PEs can be made reliable with circuit- 
level techniques at a relatively low cost. 


B. Erasure Probability of Synthetic Channels Under Faulty 
SC Decoding 


Using the fault model introduced in the previous section, 
we can rewrite the recursive expressions for Z'fl. (i.e., ([T]) and 
(| 2 |i) in order to obtain a recursive expression for the erasure 
probability of the synthetic channels in the faulty case, which 


we denote by ^ = E 


E 


(s) 

s,k,5 


. Specihcally, we have 


7 )®”) _ 7 )®) I 7 )®) _ 7 )®) 7 )®) 

^s,k,5 ^s—l,k,S "^s —l,fc-|-2"-“,(5 —l,fc-|-2"-'*,5’ 


1 ( 7 ( 3 ) I 7 ( 3 ) 

l,fe,5 ^s — l,k+2''-‘ ,S 


_ 7 )®) 7 )®) 


( 22 ) 


(®+) _ 7(®) 


ZTks = 


7 )®) I ( 7 (®) 7 )®) r 

s-l,k,S^s-l,k+2^-‘,S ^ s-l,k,S^s-l,k+2^—> ,S) 


(23) 


with 5 = Pi fc = 0,..., 2" — 1. The channels 51 k = 
0 ,..., 2 "“® — 1 , are independent copies of the same type of 
channel, meaning that their erasure probabilities are identical. 
Thus, if we are only interested in the erasure probability of 
a specific type s of channel we can simplify (IB and (Ell by 
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omitting the index k as 



(24) 

(25) 


with = p. The vector containing all s G {+,—}*, 

variables is denoted by Zg , 5 . The random process Cg can be 
rewritten for the faulty case as 


rT+(eg_i. 5 ) w.p. 1 / 2 , 
[Tg {es- 1 , 5 ) w.p. 1 / 2 , 


(26) 


with Co ,5 = Z{W) = p. 


Property 4. For the expectation of the process es, 5 , s = 
0, 1 ,..., defined in (126b we have 

E(eg,^) = l-(l-eo)(l-<5)^ (34) 

Proof: From the proof of Property [2 we know that 

E(eg,5|es_i,5) = eg_i,5 + (1 - es_i,5)(5. (35) 

By taking the expectation with respect to es-i ,5 on both sides 
of (I35ll, we have 

E(eg_5) = E(eg_i,5) + (1 - E((es_i,5))(5 (36) 

= (1 - <5)E(eg_i.5) + <5, (37) 

with E(eo, 5 ) = £ 0,5 = p. The solution of this recurrence 
relation is 


C. Properties ofT^ and Tg 

In this section, we show some properties of the and Tf 
transformations, which will be useful to prove two negative 
results in the following section, as well as to interpret some 
of the numerical results of Section Ivnl 

Property 1. For T/*'(e) and Tf{e), we have 

(i) T+{e)>S, Ve,JG [0,1], 

(ii) Tg-ie)>S, Ve,5G [ 0 , 1 ], 

Proof: For T'g (e), we have 

+ (1 - £ 2 )^ >5^ (27) 

(1 - 5)e^ > 0 , (28) 

which indeed holds for any e, <5 G [0,1]. Similarly, for Tf{e), 
we have 

2e - + (1 - 2e + ^ (29) 

(1-5)(2e-e2) > 0, (30) 

which indeed holds for any e, <5 G [0,1]. ■ 

Property 2. The fixed points ofT^ (e) are e = 1 and e = 

The unique fixed point ofTf{e) for e G [0,1] is e = 1. 

Proof: The above property can easily be shown by solving 
(e) = e and Tf (e) = e for e, respectively, and noting that 
one solution of Tf{e) = e is negative. ■ 

Moreover, the following two properties of the process es,s 
give us some first insight into the effect that the faulty decoder 
has on the decoding process. 

Property 3. The process €s,s, s = 0,1,..., defined in ((26) 
is a submartingale. 

Proof: Since eg 5 is bounded, it holds that E(|eg 5 ]) < 00 . 
Moreover we have 

Eies,s\es-i,s) = ^ {Tg+ies-ps) + ^.-(eg-i.i)) (31) 

= 2 + 2es-i,5 

+ (1 — 2eg_i^5 + eg_i ,5)(5) (32) 

= Cg—i^^ T (1 — eg_i^5)(5 > Cg—i^^. (33) 


E(eg.^) = l-(l-p)(l-,5)^ (38) 

■ 

Specifically, this tells us that, contrary to Ha, the average 
erasure probability is not preserved by T^{e) and Tf{e). 
Thus, even if fully reliable transmission were possible in the 
limit of inhnite blocklength, the polar code would not be 
capacity achieving since P [cg ^ = 0 ] < 1 — p, meaning that 
the fraction of noiseless channels would be strictly smaller 
than the capacity of the BEC. 

D. Polarization Does Not Happen 

Unfortunately, as the following proposition asserts, fully 
reliable transmission under faulty decoding is not possible. 

Proposition 1. Let Q denote the sample space of the process 
Cs^S ond let eg ^(g), q G S, denote a specific realization of 
Cs^s- Polarization does not happen under faulty SC decoding 
for the BEC in the sense that $q £ Q such that Eg ,5 ( 9 ) ^0. 

Proof: This is a direct consequence of Property [T] since 
all es^s{q) are produced by repeated applications of T'g and 
Tf to £0,5 = p, so that Eg, 5 ( 9 ) > 6 , yqC Q. ■ 

It turns out that we can prove the following stronger result, 
which states that, under faulty SC decoding over the BEC, 
almost all channels become asymptotically useless. 

Proposition 2. For the process eg, 5 , s = 0,1,..., defined in 
(126) . we have £ 5,5 21 :^ 1 . 

Proof: Erom Property [3] we know that £g ,5 is a bounded 
submartingale. Thus, it converges a.s. to some limiting random 
variable Coo- Moreover, from Property [4] we have 

E(£g, 5 ) = l-(l-p)(l-J)^ (39) 

which directly implies that limg^oo E(£g, 5 ) = 1 , since, by 
assumption, 5 G (0,1). Equivalently, and since £g ,5 G [0,1], 
we can write 

lim E(|£g ,5 - 1|) = 0, (40) 

S—¥00 

which means, by definition, that £ 5,5 —1. Moreover, 

P 

£g ,5 —1 implies that £g ,5 —> 1. Since we know, due to 
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the submartingale property, that Cs^s also converges almost 
surely and almost sure convergence implies convergence in 
probability, all the aforementioned limits must be identical and 
we can conclude that eg^s 1 - ■ 


E. Synthetic Channel Ordering 

In the case of non-faulty decoding, there exists a partial 
ordering of the synthetic channels with respect to the BEC 
erasure probability p. In order to explain this ordering, we 
first need to define the notion of “ 77 -goodness”. 

(s') 

Definition 1. A synthetic channel Wg is said to be “ 77 -good” 
if < 77 . 

In the non-faulty case, it is easy to see that both T'^{e) 
and T~{e) are increasing in e, Ve G [0,1]. Thus, a synthetic 
channel that is 77 -good for a BEC with erasure probability pi, 
will also be 77 -good for a BEC with erasure probability p 2 
when p 2 < Pi ■ 

In this section, we show that under faulty decoding the 
partial ordering with respect to the BEC parameter p is 
preserved and we show that a similar partial ordering exists 
with respect to the decoder erasure probability 5. To this end, 
in the following two properties we examine the monotonicity 
of (e) and (e) with respect to e and 5. 

Property 5. Both T^[e) and T^{e) are 

(i) Increasing in e, Ve G [0,1]. 

(ii) Increasing in S, \/6 G [0,1]. 

Proof: (i) (e) can be re-written as 

T+{e) = e^ + {l-e^)5 (41) 

= e2(l-5)-fA (42) 

Thus, for any fixed <5 G [0,1], T'^(e) is clearly increasing in 
e for any e G [0,1]. Similarly, Tf~{e) can be re-written as 

T^{e) =2e-e^ + {l-2e + e^)6 (43) 

= (2e-e2)(l-5)-f A (44) 

Thus, the partial derivative of Tf{e) with respect to e can 
easily be calculated as 

= 2(1 - e)(l - 5), (45) 

which, for any fixed 5 G [0,1], is non-negative Ve G [0,1]. 


(ii) Both (e) and (e) are linear functions of 5 
with a non-negative coefficient, so they are increasing 
V(5 G M. ■ 


Proposition 3 (Monotonicity with respect to p). Let pi,p 2 G 
(0,1), P 2 < Pi and S G (0,1). A synthetic channel that is 
r]-good for a decoder with a fixed erasure probability S over 
a BEC with erasure probability pi is also p-goodfor the same 
decoder over a BEC with erasure probability p 2 . 


Proof: The erasure probability of any synthetic channel 

(s') — 

g can be calculated by repeated applications of Tg and 
starting from p as 


zSip)=Tg^^ {Tr'i---(Tg^^ip)))), 


where s = [s^, s^-r, ■ • ■ , si] and Si G {-|-,-},7 = 

Since from Property |3i) we know that both Tf{e) and (e) 
are increasing with respect to e, any composition of the two 
functions will also be increasing. Thus 

Zf}{p 2 )<zf}{pi)<p. (47) 


The following proposition states that there also exists a 
partial ordering of the synthetic channels with respect to the 
decoder erasure probability <5. This is a useful property, as it 
ensures that, for any given polar code, a decoder with internal 
erasure probability 82 will not perform worse than a decoder 
with internal erasure probability 5i, where ^2 < <5i. 

Proposition 4 (Monotonicity with respect to 5). Let , <52 G 
(0,1), 82 < ^r (ind e G (0,1). A synthetic channel that is 
rj-good for a decoder with erasure probability (5i over a BEC 
with a fixed erasure probability e is also p-goodfor a decoder 
with erasure probability 82 over the same channel. 

Proof: Similarly to the proof of Proposition [2 the proof 
stems directly from the monotonicity of Tf (e) and T'g (e) with 
respect to <5 shown in Property |5lii). ■ 


IV. Erame Erasure Rate Under Eaulty Decoding 

In this section, we adapt the framework of m to the case 
of faulty decoding in order to derive a lower bound on the 
frame erasure probability under faulty decoding. Let Pe{An) 
denote the frame erasure rate (EER) of a polar code of length 
2" with information set An- Erom mi, we have the general 
upper bound 

PeiAn) < Y, 

Eurthermore, from iflBll we have the lower bound 

Pe{An) >YZ^^-\ E 4 pLB 

sGAn S,tGAn: 

S^t 

(49) 


where Cn = : s,t G {+,—}”] denotes the co- 

variance matrix of the random vector E„, where — 

cov[i?i*^i?i*^]. It was shown in lfT3ll that, in the non-faulty 
case, the elements of Cs,s = 1,...,77, can be calculated 
recursively from the elements of C^-r and as follows 


^(s—,t+) _ 

^(s+,t—) _ 2 Z^^'^ Z^^^ 

C7(s+,t+) ^ 2ZYz^gliCfZ’ + cEr^", 


(50) 

(51) 

(52) 

(53) 


with C, 


( 0 , 0 ) 


= p{l — p). In the case of reliable decoding, the 
second sum in ( |49] | goes to zero as n is increased fl3l if 
R = < 1 — p, so that 


PeiAn) 


S&An 


(54) 


(46) 






7 


We can use the upper and lower bounds of ( |49] ) and ( l48l l for 


the case of faulty decoding by replacing with and 

with Ciy\where cj’,*) 4 [cjf : s, t e {+, -}"], is 
the covariance matrix of the random vector E„ , 5 . In the case of 
faulty decoding, as n is increased, we know from Proposition|2] 
that almost all s, t G An, are equal to 1 . Moreover, 

the non-diagonal elements of C!^g' still converge to 0 for 
any s, t, as almost all indicator variables become deterministic 
like in the fault-free case. Thus, for some n the lower bound 
of (|49] | becomes negative and can be replaced by the trivial 
lower bound Pe{An) > maxgg^^ zj^g. Similarly, for some 
n the upper bound of ( l48l l becomes greater than 1, so it can 
be replaced by the trivial upper bound Pf.{An) < 1- Clearly 
though, since Z^ g converges to 1 as n grows for almost all 
s G {+, —we have lim„_^oo -Pe(-4„) = 1 for any An such 
that lim„^oo -f* 0 . 


^(s) 


A. Lower Bound on Pe(An) Under Faulty Decoding 

is) 

We already have an efficient way to calculate Z^ g recur¬ 
sively (i.e., (l24li and (l25]lk but, in order to evaluate P^^, we 
still need to find an efficient way to calculate Cn,s- To this 
end, we first introduce a property which we then combine with 
the results of m in order to obtain a recursive expression for 

j ^ — 1 , . . . , 71 . 

Property 6. Let X, Y denote two arbitrary random variables. 
Let Ai,A 2 denote two random variables with Ai,A 2 G 
{0,1} and E [Ai] = E [A 2 ] = 5 that are independent of X, Y 
and of each other. Then, we have 


cov [X + {1- X)Ai,Y -f (1 - F)A 2 ] = (1 - Sfcov [X, F]. 

(55) 


Proof: For simpler notation, let us define X' A X Y [1 — 
A)Ai and Y' A Y + {\ — F)A 2 . We then have 

cov [A', Y'] = E[A'F'] - E[A']E[F'] (56) 

= E[(l - Ai)A + Ai)((l - A2)F + A 2 )] 

- E[(l - Ai)A + Ai]E[(l - A2)F + A 2 ] 

(57) 

= E [(1 - Ai)(l - A 2 )] (E[AF] - E[A]E[r]) 

(58) 

(1-5)2cov[A,F], (59) 


where for (*) we have used the independence of Ai and A 2 
from X and Y, while for (*= 1 =) we have used the independence 
between Ai and A 2 . ■ 


Proposition 5. The covariance matrix of the random vector 


denoted by 
p j A r p(®) p(t) 


~ [C's.S 


s,t G {+,—}*], where 
can be computed in terms of Cg-i ,5 


and Zg_i ,5 as follows: 

= (1 - < 5 )' + A-i/) 


(60) 


(61) 


= (1 - - cti /) 

( 

p(s+,t —) r\2 (^ y{^) y{t.) ^(s,t) ^(s,t) 

^s,S — (-*- c) \^^yis_i g^s_i gFg_j^ g J , 

(62) 

^(s+,t+) /I r\2 ft) yis) y{t) ^(s,t) ^(s,t) 

^s,<5 — (^ c) \^ttZ^_)gZ^_)gC^_)g + L,,_)g j, 

(63) 

with =p(l — p)- 

Proof: To avoid unnecessary repetition, we prove the 
result only for (l63t . as the remaining relations (I60l l- (l62l i can 
be derived in the same way. Recall that, in the case of faulty 
decoding, from (ISTT l we have 

U,^g - F,_i^g Fs-I,s + [^ - ^S-I,s A 

(64) 

AP = ePJAIJ' + (1 - ePJeAP) 


Let us define A A eAAPiJ'^ Y = E^Je^P^ 
a(s+) a a A 2 . Then, we can rewrite as 

= A + (1-A)Ai, ( 66 ) 

eA = Y + {1-Y)A2, (67) 


i(s-l-) 


where A and Y are identical to the update rule for E„ 
and in the fault-free case given in (fTol i. respectively. 


Using E 


(s+) 


= E 


(t+) 


= 5, along with the fact that 


aI*^^ and aI*^ are independent by assumption, we can apply 
Proposition | 6 ] to the update formula for cov [A, Y] from lITSl 
given in (l5^ . in order to obtain (l63t . ■ 

It is intuitively pleasing to note that, for i5 = 0 (i.e., for fault- 
free decoding), the expressions in (l60l) - (l6^ become identical 
to the expressions in (I50l l- (l53t . 


V. Unequal Error Protection 

As mentioned in Section HI standard methods employed to 
enhance the fault tolerance of circuits, such as using larger 
transistors or circuit-level error correcting codes, are costly in 
terms of both area and power if the entire circuit needs to 
be protected. With this in mind, we note that in SC decoding 
of polar codes not all levels in the tree of PEs are of equal 
importance, meaning that it may suffice to employ partial 
protection of the decoder against hardware-induced errors. In 
fact, we shall see in Proposition | 6 ] a careful application of 
such a protection method allows polarization to happen even 
in a faulty decoder while protecting only a constant fraction 
of the total decoder PEs. 

Let Tip denote the number of levels that are protected, 
starting from level n of the tree (i.e., the root) and going 











towards the leaves. We assume that for these rip levels we 
have (5 = 0. Let iVp denote the total number of protected PEs, 
where 


^P = 



23 = 2"p - 1, 
0 , 


Tip ^ 0, 

Tip — 0 . 


( 68 ) 


If we set Tip = (n + 1 ) — n^, where tTu > 0 is a fixed number 
of unprotected levels, then the fraction of the decoder that is 
protected converges to a constant as n grows. Indeed, we have 


AT — 1 

lim —^ = lim ———]- 

n—^oc WpE n—loc — 1 


= 2 “"'". 


(69) 


In this case, the process ,5 can be rewritten as 


resulting polar code becomes asymptotically useless. However, 
there must exist at least one blocklength which minimizes the 
FER and it is of great practical interest to identify this length. 

Since this is a finite-length problem with practical appli¬ 
cations, there will usually be a pre-defined maximum block- 
length riniax for which a decoder is implementable with 
acceptable complexity. Thus, for a given rimax, we define 
Af = {0,... jrtinax} as the set of n values of interest. Eor 
a given code rate R, we define the n* which leads to the 
optimal blocklength N* = 2" as 

n* = arg min Pe(-4„). (75) 

rtGJV" 


if S = 1, . . . , TTu, 

if s = riu -f 1,..., n. 

(70) 

The following proposition asserts that the protection of a 
constant fraction of the decoder is sufficient to ensure that 
polarization happens as n grows. 

Proposition 6. Setting rip = s — riufor any fixed rij, suffices to 
ensure that converges almost surely to a random variable 
£00 S {0, !}• However, the unprotected levels result in a rate 
loss AR( 6 ,p,nu), in the sense that P{eoo =0) = 1 — p — 
AR(S,p,nu), which can be calculated in closed form as 

AR{S,p, n„) = (!-(!- 5)”")(1 - p). (71) 

Proof: The process es,s as defined in (iTOl l is a submartin¬ 
gale for s < ria, but it becomes a martingale for s > n^^. Thus, 
for s > riu we have E(es^ 5 ) = Using the arguments 

from ca, we can show that es ,5 converges almost surely to a 
random variable too € {0,1} with P(eoo = 0) = 1—E(e„J < 
1 — p. Equivalently, P(eoo = 0) = 1 — p — AR{5, e, riu) for 
AR{ 6 , e, nf) = E(e„J — p. Using the closed form expression 
for E(es, 5 ) from Property]?] we get 

AR{ 6 , p, riu) = E(e„J - p (72) 

= l-{l-p){l-6r"-p (73) 

= (l-(l-5)”“)(l-p). (74) 


['r+(es_i, 5 ), w.p. 1/2, 
, ^ J'77(es-i,5)( w.p. 1/2, 

] T+{es-i,s), w.p. 1/2, 
[t {es-i,s), w.p. 1/2, 


Proposition | 6 ] implies that, when partial protection of the de¬ 
coder is employed, polar codes are still not capacity achieving, 
but they can nevertheless be used for reliable transmission at 
any rate R such that i? < 1 — p — AR{S,p, riu). 


A simple way to identify the optimal blocklength is to 
perform extensive Monte-Carlo simulations of the codes for 
all n G Af. However, we can find the solution more efficiently 
by using the bounds on Pe{An) given by (l48l l and (|49] |. First, 
we study the special case where p < S. More specifically, the 
following proposition shows that, when p < 5, it is optimal in 
terms of the FER to use uncoded transmission, as the faulty 
decoder can only increase the FER. 

Proposition 7. If p < S, then n* = 0. 

Proof: The FER for n = 0 (i.e., uncoded transmission) 
over a BEC(p) is equal to p. From Property [T] we know that 
Z^\ > 5, Vs G {+)—}"■ Since p < (5 by assumption, we 
have > p, Vs G {-f, —}". Thus, using the trivial lower 
bound on the FER, i.e., P^^ = maxsg^„ we can see that 
P/"® > p for any An such that |^„| > 0. Thus, in this special 
case coded transmission with any blocklength such that n > 0 
and at any rate R > 0, leads to a higher FER than uncoded 
transmission. ■ 

In general, we can efficiently evaluate P]f^{An) and 
P^^{An) for all n G A/” for a given rate R. Using these 
values, we can deduce whether there exists a single n G Af 
satisfying the following inequality 

P^^{An) < Pe^{An'), Vn' G U. (76) 

If there exists such a unique n G N, then clearly this is the 
optimal n* . Otherwise, we need to examine (via Monte-Carlo 
simulations) all n G A/" for which P^^{An) and 
overlap, i.e., for which 3n' G Af and 3B G {UB,LB} such 
that 


Pi^ {Arc ) < Pf (An) < P^^ {An' ) ■ (77) 


VI. Optimal Blocklength Under Faulty Decoding 

In the finite blocklength regime, which is of practical 
interest, there are two clashing effects occurring. On one side, 
we have the polarization process, which tends to decrease the 
code’s FER as the blocklength is increased, but on the other 
side we have the internal erasures of the decoder which tend 
to increase the code’s FER as the blocklength is increased. 
From Proposition |2| we already know that, as the blocklength 
is increased towards infinity, the latter effect dominates and the 


VII. Numerical results 

In this section we provide some numerical results to explore 
the process es,s, as well as the FER performance of polar 
codes constructed based on this process. Moreover, we use 
the FER bounds derived in Section |IV] in order to find the 
optimal blocklength for polar a polar code under faulty SC 
decoding and we explore the effectiveness of the unequal error 
protection scheme described in Section |V] 
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values for polar codes of length N = 256,1024,4096, 
designed for the BEC(0.5) under faulty SC decoding with 
S = 10“® and non-faulty decoding, respectively. 



Rate 

Fig. 4: Evaluation of and for polar codes of lengths 
N = 256,1024,4096, designed for the BEC(0.5) with S = 
10 -®. 


Remark: Most of the results in this section are persented 
for a decoder erasure probability of i5 = 10“®. From Prop¬ 
erty [T] we know that the erasure probability of the synthetic 
channels is lower bounded by 5. Moreover, from (|48]) . we 
know that the frame error rate is upper bounded by the sum 
of the erasure probabilities of the synthetic channels used to 
transmit information. In the numerical experiments we did, we 
saw that the same number also provides a good lower bound 
for most code rates. Thus, have we selected 5 = 10“® as this 
leads to frame error rates that are practically relevant for the 
blocklengths that we have considered. 

(s) 

A. Bhattacharyya Parameters g 

In Figured we show the sorted values Z^'^g, s e {+, —}”, 
for polar codes with n = 8,10,12, designed for the BEC(0.5) 
under faulty SC decoding with S = 10“®. We observe that we 
always have Z ^^^ > <5, as predicted by Property [T] Moreover, 
e = is a fixed point of (e), but it is not a fixed point 
of T^{e) (whereas e = 1 is a fixed point for both), resulting 
in the staircase-like structure that we can observe in Figure |3] 

B. Frame Erasure Rate 

In Figure |4] we present the evaluation of P^F® and P^F® as 
a function of R and for N = 256,1024, 2048, for a faulty SC 
decoder with <5 = 10“® and transmission over the BEC(0.5). 
We observe that, especially for low rates, and P^® are 
practically indistinguishable. For rates R > 0.30 we start 
observing a difference between the lower bound and the upper 
bound, while for R > 0.40 both the upper bound and the 
lower bound break down and should be replaced by their trivial 
versions P^^ = 1 and Pj"® = maxsg^„ ^nS- Moreover, we 
observe that over a wide range of rates the FER under SC 
decoding actually increases when the blocklength is increased, 
contrary to the fault-free case where increasing the blocklength 


generally decreases the FER. This can be explained if we 
recall that Z^l > S. Thus, by increasing the blocklength while 
keeping the rate fixed, we are increasing the number of terms 
in dH, and since some of these terms do not decrease beyond 
some point, the value of the sum can increase. 

C. Optimal Blocklength 

An example of the evaluation of P^F® and Pj"® for 
N = 2", n = 4,..., 12, and code rates R S 

{0.1250,0.1875,0.2500} (where K = [PiV]) is shown in 
Figure |5] under faulty SC decoding with <5 = 10“® over a 
BEC(0.5). We observe that the bounds are tight enough in 
this case so that there always exists a unique n G JV that 
satisfies dThl l. Thus, for R = 0.1250 the optimal blocklength 
is = 128, for R = 0.1875 the optimal blocklength is 
N = 256, and finally for R = 0.2500 the optimal blocklength 
is IV = 512. 

D. Unequal Error Protection 

The effect of the partial protection for a finite length code 
is illustrated in Figure |6] where we present P^^{An) for 
N = 2^° = 1024 and 5 = 10“® when rip = 0,..., 5, levels of 
the tree are protected. To improve readability, we intentionally 
omit Pl‘^{An) from the figure. However, we have already 
seen that the bounds are tight, especially for low rates, so 
using only the upper bound is sufficient to illustrate the effect 
of unequal error protection. We observe that protecting only 
the root node already improves the performance significantly, 
especially for the lower rates. When rip = 5, the performance 
of the faulty SC decoder is almost identical to the non-faulty 
decoder in the examined FER region and it is remarkable that 
this performance improvement is achieved by protecting only 
~ 1.5% of the decoder. Moreover, in Figure |7] we 
present P^^{Ai) for N = 512,1024,2048, and 5 = 10"® 
with rip = n — 5, so that the protected part for each N 
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Blocklength (n) 


Fig. 5; Evaluation of and for 

N = 2", n — 0,..., 12, and various code rates 

R G {0.1250,0.1875,0.2500} for transmission over a 
BEC with erasure probability 0.5 under faulty SC decoding 
with S = 10“®. 



Eig. 6: EER for a polar code of length N = 1024 designed 
for the BEC(0.5) under faulty SC decoding with d = 10“® 
and Up = 0,..., 5, protected decoding levels. Protecting rip = 
n + 1 levels is equivalent to using a non-faulty decoder. 

is fixed to approximately 1.5% of the decoder. We observe 
that, contrary to the results of Section IVIll increasing the 
blocklength actually decreases Pe{An) in the examined EER 
region, as in the case of the non-faulty decoder. 

VIII. Conclusion 

In this paper, we studied faulty SC decoding of polar codes 
for the BEC, where the hardware-induced errors are modeled 
as additional erasures within the decoder. We showed that, 
under this model, fully reliable communication is not possible 
at any rate. Eurthermore, we showed that, in order for partial 
ordering of the synthetic channels with respect to the BEC 
parameter p to hold, the internal erasure probability of the 



Eig. 7; EER for polar codes of length N = 512,1024, 2048, 
designed for the BEC(0.5) under faulty SC decoding with 
S = 10“® and rip = n — 5 protected decoding levels. 

decoder has to be approximately smaller than the erasure 
probability of the BEC. Moreover, we derived a lower bound 
on the frame erasure rate and we used this lower bound in 
order to optimize the blocklength of polar codes under faulty 
SC decoding. Einally, we proposed an error protection scheme 
which re-enables asymptotically error-free transmission by 
protecting only a constant fraction of the decoder. Einally, our 
unequal error protection scheme was shown to significantly 
improve the performance of the faulty SC decoder for finite- 
length codes by protecting as little as 1.5% of the decoder. 
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